(FILE 'HOME' ENTERED AT 09:36:26 ON 07 NOV 2003) 



FILE 'REGISTRY' ENTERED AT 09:36:41 ON 07 NOV 2003 
LI 2 S CGGTTTAATGGCTTGTTGTGCT/SQSN 

L2 2 S LI AND 15-30/SQL 

FILE 'CAPLUS' ENTERED AT 09:38:19 ON 07 NOV 2003 
L3 1 S L2 

L4 0 S LI AND 15-100/SQL 

FILE 'REGISTRY' ENTERED AT 09:39:48 ON 07 NOV 2 0 03 
L5 2 S LI AND 15-100/SQL 

L6 8 S ATGCCATTAAACCGGTGGC/ SQSN 

L7 2 S L6 AND 15-100/SQL 

FILE ' CAPLUS ' ENTERED AT 09:42:23 ON 07 NOV 2003 
L8 6 S L6 

FILE 'REGISTRY' ENTERED AT 09:46:05 ON 07 NOV 2003 

FILE 1 STNGUIDE 1 ENTERED AT 09:46:40 ON 07 NOV 2003 
L9 QUE CGGTTTAATGGCTTGTTGTGCT | AGCACAACAAGCCATTAAACCG | ATGCCATTAA 

FILE 'REGISTRY' ENTERED AT 09:53:36 ON 07 NOV 2003 
L10 14 S L9 

Lll 6 S L10 AND SQL<101 

FILE 1 CAPLUS ' ENTERED AT 09:54:58 ON 07 NOV 2003 
L12 1 S Lll 

L13 6 S L10 



= > 
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SEQ 27201 gcagtctgat gccattaaac cggtggctat tttccttgat ggctttgcgt 



HITS AT: 27209-27227 

** RELATED SEQUENCES AVAILABLE WITH SEQLINK** 
RN 405050-93-3 REGISTRY 

CN DNA (Escherichia coli strain 0157 :H7 clone OZID_244 fragment) (9CI) (CA 

INDEX NAME) 
OTHER NAMES : 

CN 242: PN : US6365723 SEQID: 242 claimed DNA 
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PATENT ASSIGNEE (S) : 
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DOCUMENT TYPE: 
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PATENT INFORMATION: 



2002 : 937301 CAPLUS 
138 :20541 

DNA and protein sequences of enterohemorrhagic E. coli 
0157 :H7 specific proteins and their used in diagnosis 
and therapeutics 

Hayashi, Hideo; Shinagawa, Hideo; Makino, Kozo; 

Hayashi, Tetsuya; Onishi, Makoto; Hattori, Shohei; 

Kurokawa , Ak i r a 

Tsukuba University, Japan 

Jpn. Kokai Tokkyo Koho, 2 067 pp. 

CODEN: JKXXAF 

Patent 

Japanese 

1 



PATENT NO. 



KIND DATE 



APPLICATION NO. 



DATE 



A2 



20021210 



JP 2002-15959 
JP 2001-112010 A 



20020124 
20010124 



JP 2002355074 
PRIORITY APPLN. INFO. 

AB This invention provides DNA and protein sequences of enterohemorrhagic E. 

coli 0157 :H7 specific proteins. The proteins are specifically present in 
E. coli 0157:H7 rather than in non-pathogenic E. coli K-12. The 0157:H7 

specific DNA and protein can be used in diagnosis and treatment of 

IT 



digestive 


track hemorrhage. 
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RL: BSU (Biological study, unclassified); DGN (Diagnostic use); PRP 
(Properties) ; THU (Therapeutic use) ; BIOL (Biological study) ; USES (Uses) 

(nucleotide sequence; DNA and protein sequences of enterohemorrhagic E. 

coli 0157 :H7 specific proteins and their used in diagnosis and 

therapeutics) 
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2002 :251953 CAPLUS 
136 :258372 

DNA sequences unique to the pathogenic 0157 :H7 strain 
of Escherichia coli 

Blattner, Frederick R . ; Burland, Valerie; Perna, 

Nicole T.; Plunkett, Guy; Welch, Rod 

Wisconsin Alumni Research Foundation, USA 

U.S. , 26 pp. 

CODEN : USXXAM 

Patent 

English 

1 



PATENT NO. 



KIND DATE 



APPLICATION NO. DATE 



US 6365723 Bl 
US 2003023075 Al 
PRIORITY APPLN. INFO.: 



20020402 US 1999-453702 19991203 

20030130 US 2002-114170 20020401 

US 1998-110955P P 19981204 
US 1999-453702 Al 19991203 
AB The entire genome of pathogenic Escherichia coli strain 0157 :H7 has been 

sequenced. All of the genomic DNA sequences present in 0157 and absent in 
the previously sequenced laboratory strain K12 are provided. These sequences 
correspond to 255 islands of 0157 DNA. This sequence information is 
needed for comprehensive efforts at detection, diagnosis, prophylaxis, and 
therapeutic approaches to infections caused by the organism. 



REFERENCE COUNT: 



THERE ARE 4 CITED REFERENCES AVAILABLE FOR THIS 
RECORD. ALL CITATIONS AVAILABLE IN THE RE FORMAT 



IT 



405050-88-6 
405050-93-3 

405050- 97-7 

405051- 02-7 
405051-07-2 
405051-12-9 



405050-89-7 
405050-94-4 

405050- 98-8 

405051- 03-8 
405051-08-3 
405051-14-1 



405050-90-0 
405050-95-5 

405050- 99-9 

405051- 04-9 
405051-09-4 



405050-91-1 

405050- 96-6 

405051- 00-5 
405051-05-0 
405051-10-7 



405050- 92-2 

405051- 01-6 
405051-06-1 
405051-11-8 



THU 



RL: BSU (Biological study, unclassified) ; PRP (Properties) ; 
(Therapeutic use) ; BIOL (Biological study) ; USES (Uses) 

(nucleotide sequence; DNA sequences unique to the pathogenic 0157 :H7 

strain of Escherichia coli) 
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ACCESSION NUMBER: 
DOCUMENT NUMBER: 
TITLE: 

AUTHOR (S) : 



CORPORATE SOURCE: 



SOURCE : 



CAPLUS COPYRIGHT 2 0 03 ACS on STN 
2001 :209011 CAPLUS 
137 : 196299 

Genome sequence of enterohaemorrhagic Escherichia coli 
0157 :H7. [Erratum to document cited in CA134 : 232542 ] 
Perna, Nicole T . ; Plunkett, Guy, III; Burtand, 
Valerie; Mau, Bob; Glasner, Jeremy D.; Rose, Debra J.; 
Mayhew, George F.; Evans, Peter S.; Gregor, Jason; 
Kirkpatrick, Heather A.; Postal, Gyorgy; Hackett, 
Jeremiah; Klink, Sara; Boutin, Adam,* Shao, Ying; 
Miller, Leslie; Grotbeck, Erik J.; Davis, N. Wayne; 
Lim, Alex; Dimalanta, Eileen T . ; Potamousis, 
Konstantinos D.; Apodaca, Jennifer; Anantharaman , 
Thomas S.; Lin, Jleyl; Yen, Galex; Schwartz, Dvaid C; 
Welch, Rodney A.; Blatner, Frederick R. 
Genome Center of Wisconsin, Department of Animal 
Health and Biomedical Sciences, Laboratory of 
Genetics, Department of Chemistry, Department of 
Biostatistics, and Department of Medical Microbiology 
and Immunology, University of Wisconsin, Madison, WI, 
53706, USA 

Nature (London, United Kingdom) (2001), 410(6825)/ 240 
CODEN: NATUAS; ISSN: 0028-0836 



PUBLISHER: 
DOCUMENT TYPE: 
LANGUAGE : 



Nature Publishing Group 
Journal 
English 

AB The correct GenBank accession number 
IT 318232-68-7, GenBank AE005653 31823 
318232-70-1, GenBank AE005655 31823 
318232-72-3, GenBank AE005657 31823 
318232-74-5, GenBank AE005659 31823 
318232-76-7, GenBank AE005661 31823 
318232-78-9, GenBank AE005663 31823 
318232-80-3, GenBank AE005665 31823 
318232-82-5, GenBank AE005667 31823 
318232-84-7, GenBank AE005669 31823 
318232-86-9, GenBank AE005671 
RL: BSU (Biological study, unclassifi 
(Biological study) 

(nucleotide sequence; genome seque 
coli 0157 :H7 (Erratum)) 



for the annotated sequence 
2-69-8, GenBank AE005654 
GenBank AE005656 
GenBank AE005658 
GenBank AE0 05 660 
GenBank AE005662 
GenBank AE005664 
GenBank AE005666 
GenBank AE005668 



is AE005174, 



2-71-2 , 
2-73-4, 
2-75-6, 
2-77-8, 
2-79-0, 
2-81-4, 
2-83-6, 



2-85-8, GenBank AE005670 
ed) ; PRP (Properties) ; BIOL 
nee of enterohemorrhagic Escherichia 
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AUTHOR (S) 



CORPORATE SOURCE: 

SOURCE : 

PUBLISHER: 
DOCUMENT TYPE: 
LANGUAGE : 
AB 



CAPLUS COPYRIGHT 2 003 ACS on STN 
2001 : 184271 CAPLUS 
134 :217892 

Complete genome sequence of enterohemorrhagic 
Escherichia coli 0157 :H7 and genomic comparison with a 
laboratory strain K-12 

Hayashi, Tetsuya; Makino, Kozo; Ohnishi, Makoto; 
Kurokawa, Ken; Ishii, Kazuo; Yokoyama, Kat sushi ; Han, 
Chang-Gyun; Ohtsubo, Eiichi; Nakayama, Keisuke; 
Murata, Takahiro; Tanaka, Masashi; Tobe, Toru; Iida, 
Tetsuya; Takami, Hideto; Honda, Takeshi; Sasakawa, 
Chihiro; Ogasawara, Naotake; Yasunaga, Teruo; Kuhara, 
Satoru; Shiba, Tadayoshi ; Hattori, Masahira; 
Shinagawa, Hideo 

Department of Microbiology, Miyazaki Medical College, 
Miyazaki, 899-1692, Japan 
DNA Research (2001), 8(1), 11-22 
COD EN : DARSE8; ISSN: 1340-2838 
Universal Academy Press 
Journal 
English 

Escherichia coli 0157 :H7 is a major food-borne infectious pathogen that 
causes diarrhea, hemorrhagic colitis, and hemolytic uremic syndrome. The 
complete chromosome sequence of an 0157 :H7 strain isolated from the Sakai 
outbreak is reported, and the results compared with the genome of a benign 
laboratory strain, K-12 MG1655. The chromosome is 5.5 Mb in size, 859 Kb 
larger 

than that of K-12. A 4.1 -Mb sequence highly conserved between the two 
strains is identified, which may represent the fundamental backbone of the 
E. coli chromosome. The remaining 1.4 -Mb sequence comprises of 
0157 :H7-specif ic sequences, most of which are horizontally transferred 
foreign DNAs . The predominant roles of bacteriophages in the emergence of 
0157 :H7 is evident by the presence of 24 prophages and prophage-like 
elements that occupy more than half of the 0157 :H7-specif ic sequences. 
The 0157 :H7 chromosome encodes 1632 proteins and 20 tRNAs that are not 
present in K-12. Among these, at least 131 proteins are assumed to have 
virulence-related functions. Genome-wide codon usage anal, suggested that 
the 0157 :H7-specif ic tRNAs are involved in the efficient expression of the 
strain-specific genes. A complete set of the genes specific to 0157 :H7 
presented here sheds new insight into the pathogenicity and the physiol. 
of 0157 :H7, and will open a way to fully understand the mol . mechanisms 
underlying the 0157 :H7 infection. 

REFERENCE COUNT: 62 THERE ARE 62 CITED REFERENCES AVAILABLE FOR THIS 

RECORD. ALL CITATIONS AVAILABLE IN THE RE FORMAT 

IT 328224-91-5, GenBank AP002550 328224-92-6, GenBank AP002551 
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RL: BSU (Biological study, unclassified); PRP (Properties); BIOL 

(Biological study) 

(nucleotide sequence; complete genome sequence of enterohemorrhagic 
Escherichia coli 0157 :H7 and genomic comparison with a laboratory strain 
K-12) 
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Burland, 

: Rose , Debra J. ; 



CORPORATE SOURCE: 



CAPLUS COPYRIGHT 2003 ACS on STN 
2001:98372 CAPLUS 
134 :232542 

Genome sequence of enterohaemorrhagic Escherichia coli 
0157 :H7 

Perna, Nicole T.; Plunkett, Guy, III; 
Valerie; Mau, Bob; Glasner, Jeremy D. 
Mayhew, George F.; Evans, Peter S.; Gregor, Jason; 
Kirkpatrick, Heather A.; Posfai, Gyorgy; Hackett, 
Jeremiah; Klink, Sara; Boutin, Adam; Shao, Ying; 
Miller, Leslie; Grotbeck, Erik J.; Davis, N. Wayne; 
Lim, Alex; Dimalanta, Eileen T. ; Potamousis, 
Konstantinos D.; Apodaca, Jennifer; Anantharaman, 
Thomas S.; Lin, Jieyi; Yen, Glaex; Schwartz, David C; 
Welch, Rodney A.; Blattner, Frederick R. 
Genome Center of Wisconsin, Department of Animal 
Health and Biomedical Sciences, Laboratory of 
Genetics, Department of Chemistry, Department of 
Biostatistics , and Department of Medical Microbiology 
and Immunology, University of Wisconsin, Madison, WI , 
53706, USA 

Nature (London) (2001), 409(6819), 529-533 
CODEN: NATUAS; ISSN: 0028-0836 
Nature Publishing Group 
Journal 
English 

The bacterium Escherichia coli 0157 :H7 is a worldwide threat to public 
health and has been implicated in many outbreaks of hemorrhagic colitis, 
some of which included fatalities caused by hemolytic uremic syndrome. 
Close to 75,000 cases of 0157 :H7 infection are now estimated to occur annually 
in the United States. The severity of disease, the lack of effective 
treatment and the potential for large-scale outbreaks from contaminated 
food supplies have propelled intensive research on the pathogenesis and 
detection of E. coli 0157 :H7. The genome of E . coli 0157 :H7 was sequenced 
to identify candidate genes responsible for pathogenesis, to develop 
better methods of strain detection and to advance our understanding of the 
evolution of E. coli, through comparison with the genome of the 
non-pathogenic laboratory strain E. coli K-12. Lateral gene transfer found to 
be far more extensive than previously anticipated. In fact, 1387 new 
genes encoded in strain-specific clusters of diverse sizes were found in 
0157 :H7. These include candidate virulence factors, alternative metabolic 
capacities, several prophages, and other new functions - all of which 
could be targets for surveillance. 



SOURCE : 

PUBLISHER: 
DOCUMENT TYPE: 
LANGUAGE : 
AB 



REFERENCE COUNT: 

IT 318232-68-7, 
318232-70-1, 
318232-72-3, 
318232-74-5, 



3 0 THERE ARE 3 0 CITED REFERENCES AVAILABLE FOR THIS 

RECORD . ALL CITATIONS AVAILABLE IN THE RE FORMAT 
GenBank AE005653 318232-69-8, GenBank AE005654 
GenBank AE005655 318232-71-2, GenBank AE005656 
GenBank AE005657 318232-73-4, GenBank AE005658 
GenBank AE005659 318232-75-6, GenBank AE005660 



318232-76-7, GenBank AE005661 
318232-78-9, GenBank AE005663 
GenBank AEO 0 5665 
GenBank AE005667 
GenBank AE005669 



318232-80-3 , 
318232-82-5, 
318232-84-7, 



GenBank AE005662 
GenBank AE005664 
GenBank AEO 05 66 6 
GenBank AEO 05 66 8 
GenBank AE005670 



318232-77-8, 
318232-79-0, 
318232-81-4 , 
318232-83-6, 
318232-85-8, 

318232-86-9, GenBank AE005671 
RL: BSU (Biological study, unclassified); PRP (Properties); BIOL 
(Biological study) 

(nucleotide sequence; genome sequence of enterohemorrhagic Escherichia 
COli 0157:H7) 
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2000 : 900842 CAPLUS 
134 :52236 

A DNA marker specific for Escherichia coli serotypes 
0157 :H7; 0157 : NM and 055 :H7 and its diagnostic use 
Chen, Shu; Xu, Renlin; Li, Jiping 
University of Guelph, Can. 



PCT Int. Appl. 

CODEN: PIXXD2 

Patent 

English 

1 



4 0 pp. 



PATENT NO. 



KIND DATE 



APPLICATION NO. 



DATE 



WO 2000077247 


Al 


20001221 




WO 2000-CA716 




20000614 
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ML, 


MR, 


NE, 


SN, 


TD, 


TG 








PRIORITY APPLN . 


INFO 












US 1999- 


139260P 


P 


19990615 







AB A novel DNA marker FAFLP (fluorescent amplified fragment length 

polymorphism) specific for E. coli serotypes 0157 :H7; 0157 :NM and 055 :H7 
is disclosed. The isolation of the marker allows the development of 
diagnostic assays that can be used to detect the serotypes in the sample. 
In particular, the marker can be used to prepare nucleic acid primers and 
nucleotide probes based on the sequence of the marker. 

REFERENCE COUNT: 12 THERE ARE 12 CITED REFERENCES AVAILABLE FOR THIS 

RECORD. ALL CITATIONS AVAILABLE IN THE RE FORMAT 

IT 313725-55-2 313725-56-3 

RL: PRP (Properties); THU (Therapeutic use); BIOL (Biological study); USES 
(Uses) 

(nucleotide sequence; a DNA marker specific for Escherichia coli 
serotypes 0157 :H7, 0157 :NM and 055 :H7 and diagnostic use) 
IT 313725-50-7 313725-51-8 313725-52-9 
313725-54-1 

RL: ARG (Analytical reagent use) ; PRP (Properties) ; ANST (Analytical 

study) ; USES (Uses) 

(nucleotide sequences of primers or probes; a DNA marker specific for 
Escherichia coli serotypes 0157 :H7, 0157 :NM and 055 :H7 and diagnostic 
use) 



TITLE Direct Submission 

JOURNAL Submitted (26- JUN-2000) Ken Kurokawa, Osaka University, Genome 

Information Research Center; 3-1, Yamadaoka, Suita, Osaka 565-0871, 
Japan (E-mail : ken@gen- inf o . osaka -u .ac.jp, 

URL:http : //www. gen -info. osaka -u. ac. jp/, Tel : 81 -6-6879-8365, 
Fax:81-6-6879-2047) 
COMMENT genome project. 

FEATURES Location/Qualifiers 
source 1 . . 168764 

/organism="Escherichia coli 0157 :H7" 
/mol_type= "genomic DNA" 
/strain= n 0157 :H7" 
/sub_strain="RIMD 0509952" 
/ db_xr e f = " t axon : 8 3 3 3 4 " 
gene 79 . .243 

/gene="ECs5227" 
CDS 79. .243 

/gene="ECs5227" 

/note="identical to B4250_ECOLI gi | 1790699 (Conserved in 

E.coli K-12) " 

/codon_start=l 

/evidence=not_experimental 

/transl_table=ll 

/product = "hypothetical protein" 

/protein_id="BAB38650 .1" 

/db_xref="GI :13364705" 

/ 1 ran s 1 a t i on= " MLS VLLKLL 1 1 L VLTPRAS TRLLAVRLGYLAVYSMNMS VRKL FR 

LSIFFVMIVR" 
gene 315. .569 

/gene="ECs5228" 
CDS 315. .569 

/gene="ECs5228" 

/note="similar to YJGJ_ECOLI gi | 1790700 percent identity 

92 in 84 aa (Conserved in E.coli K-12)" 

/codon_start=l 

/evidence=not_experimental 

/ trans l_table=ll 

/product = "hypothetical protein" 

/protein_id="BAB3 8 651 . 1" 

/db_xref ="GI : 13364706" 

/ t ranslat ion= "MVLEGIHSHDPQARDIAVQYYHAAETAIYDYIARLHPQSAQCVT 

DFMSTVMSGLSAKAREGHSLEQLCATAALAGEAIKTILKE " 
gene 705. .1166 

/gene="ECs5229" 
CDS 705. .1166 

/gene="ECs5229" 

/note="similar to YJGK_ECOLI gi | 1790701 percent identity 

98 in 153 aa (Conserved in E.coli K-12)" 

/codon_start=l 

/evidence=not_experimental 

/ trans l_table=ll 

/product = "hypothetical protein" 

/protein_id="BAB3 8652 . 1" 

/db_xref ="GI : 13 3 64707" 

/ trans 1 at i on= " MEKMI IGNI HNLQPWLPQELRQAIEHI KAHVTAETPKGKHD I EG 
NRLFYLISEDMTEPYEARRAEYHARYLDIQIVLKGQEGMTFSTQPAGTPDTDWLADKD 
I AFLPEGVDE KT V I LNEGDF WF YPGEVHKPLCAVGAPAQVRKAVVKMLMA " 



gene 1289. .3061 

/gene="ECs5230" 
CDS 1289. .3061 

/gene= n ECs5230 M 

/note="similar to part of (13-616 in 616 aa) YJGLJECOLI 

gi | 1790702 percent identity 82 in 590 aa" 

/codon_start=l 

/ evidence=not_experimental 

/trans l_table=ll 

/product^ "hypothetical protein" 

/protein_id= "BAB3 8653 . 1 " 

/db_xref="GI :13364708" 

/ trans lation= "MSKISDLNYSQHITLADNFKQKSEVLNTWRVGMNNFARNAGGQD 
NTRNILNPKTFLEFLVKIFTLGYVDFSKRSNEAGRNMMAHIESSSYIKNNDGSEIMKF 
VMNNPEGERADSPKVI IEISLSTITTMGTRQGHTAIIFPQPDGSTNRYERKSFERKDE 
SSLHLITNKVLACYQREANKEIARLLNNHQKLNNLQKLNNLQKLNNLQKLNNIQKLNN 
IQELNNSQELNNSQELNNSQELNNSQDLKNSQVSCKGSVDFTITDLLEKSLNNALLAI 
RNEHLLLMPHVCSESISYLLGENGILEEIDKLYELNDHGIDNDKEGNNEINDIMINLS 
HILIESLDDAKVNLTPVIHSMLMTFLELPYNNDVKILEWCFNKSMQYFDDSAKIEHAC 
SVINHINFRRDQSKVAETLFFNLDKEPYKNSPELQELIWKKLWYWDFNLSNREKTY 
LIQRIFNNVESLFNKVPVSILVNDIFMNDFFMKNTEMINWYFPRLLKSYEDEKIYFDK 
LGYNFNNKESNEEIMKNQPKDVIEEKLNNELKLRFRMMQTILKSEVNVSPFIDQQRLN 
TLNPPENLRIAIEKFGWKKKTITA" 

gene complement (3118 . .4122) 

/gene="ECs5231" 

CDS complement (3118 . .4122) 

/gene="ECs5231" 

/note="similar to ARGF_ECOLI gi | 1786469 percent identity 

87 in 334 aa, similar to ARGI_ECOLI gi | 1790703 percent 

identity 99 in 334 aa (Conserved in E.coli K-12)" 

/codon_start=l 

/ evidence=not_experimental 

/transl_table=ll 

/product="ornithine carbamoyltransf erase 1" 
/protein_id="BAB38654 .1" 
/db_xref="GI : 13364709" 

/ trans lation= "MSGFYHKHFLKLLDFTPAELNSLLQLAAKLKADKKSGKEEAKLT 
GKNIALIFEKDSTRTRCSFEVAAYDQGARVTYLGPSGSQIGHKESIKDTARVLGRMYD 
GIQYRGYGQEIVETLAEYAGVPVWNGLTNEFHPTQLLADLLTMQEHLPGKPFNEMTLV 
YAGDARNNMGNSMLEAAALTGLDLRLVAPQACWPEAALVTECRALAQQNGGNITLTED 
VAKGVEGADFIYTDVWVSMGEAKEKWAERIALLRDYQWSKMMQLTGNPEVKFLHCLP 
AFHDDQTTLGKKMAEEFGLHGGMEVTDEVFES AAS I VFDQAENRMHT I KAVMVATLS K 

M 

gene 4284. .4700 

/gene= M ECs5232" 
CDS 4284. .4700 

/gene="ECs5232 M 

/note="similar to YJGD_ECOLI gi | 1790704 percent identity 

100 in 138 aa (Conserved in E.coli K-12)" 

/ codon_start=l 

/evidence=not_experimental 

/ trans l__table= 11 

/product="hypothetical protein" 

/protein_id="BAB38655 .1" 

/db_xref="GI : 13364710" 

/ translation= "MANPEQLEEQREETRLIIEELLEDGSDPDALYTIEHHLSADDLE 
TLEKAAVEAFKLGYEVTDPEELEVEDGDIVICCDILSECALNADLIDAQVEQLMTLAE 



KFDVEYDGWGTYFEDPNGEDGDDEDFVDEDDDGVRH" 
gene complement (4746 . .5039) 

/gene="ECs5233" 
CDS complement (4746 . .5039) 

/gene= n ECs5233 M 

/note= M similar to B4256_EC0LI gi | 1790705 percent identity 

93 in 97 aa (Conserved in E.coli K-12) " 

/codon_start=l 

/ evidence=not_experimental 

/transl_table=ll 

/product= n hypothetical protein" 

/protein_id='*BAB38656 .1" 

/db_xref="GI :13364711" 

/translations "MVGGGGIAPLAGSESDICELQKMYFLPAIRGKGLAKKLALKAME 

EAREMGFKRCYLETTAFLKEAIGLYEHLGFQHIDYALGCTGHVDCEVRMLREL 1 ' 
gene 5442. .6638 

/gene= n ECs5234" 
CDS 5442. .6638 

/gene="ECs5234" 

/note= n similar to part (5-393 in 393 aa)of YJGN_ECOLI 

gi 1 2367367 percent identity 96 in 389 aa (Conserved in 

E.coli K-12) " 

/codon_start=l 

/ evidence=not_experimental 

/transl_table=ll 

/product= "hypothetical protein" 

/protein__id="BAB38657 . 1" 

/db_xref="GI : 13364712" 

/translations "MAQVINEMDVPSHSFVFHGTGERYFLICWNVLLTIITLGIYLP 
WALMKCKRYLYANMEVNGQRFSYGITGGNVFFSCLVFVFFYFAILMTVSADMPLIGCV 
LTLSLLVLLIFMAAKGLRYQALMTSLNGVRFSFNCSMKGVWWVTFFLPILMAIGMGTV 

Query Match 99.3%; Score 1572; DB 1; Length 168764; 

Best Local Similarity 99.9%; Pred. No. 0; 

Matches 1583; Conservative 0; Mismatches 0; Indels 1; Gaps 1; 



Qy 


1 


CTGCACCTTTTTTTGCTGTGCCTTTGGGATCCGCGCGAATGCGCCCAGAAAACCATCAGA 

IIIIIIIIIIIIIIIIMIIIIIIMIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

CTGCACCTTTTTTTGCTGTGCCTTTGGGATCCGCGCGAATGCGCCCAGAAAACCATCAGA 


60 


Db 


58317 


58258 


Qy 


61 


AAGTGCCACCTTGGGCGTTTGCACCGTCATTTGTCACCTCCGGACAGATGTTGCAATGTA 

IMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIII 

AAGTGCCACCTTGGGCGTTTGCACCGTCATTTGTCACCTCCGGACAGATGTTGCAATGTA 


120 


Db 


58257 


58198 


Qy 


121 


TTTTCGTCAATTGGCCCGAATGCAACGTGCCAACCCCTGGTTGTGAATAATGGAATAAAT 

IIIIIIIIIIIIIIIIMIMIIIIIIIIIIIIIIMIIIIIIIIIMIIIIIIIIIIII 

TTTTCGTCAATTGGCCCGAATGCAACGTGCCAACCCCTGGTTGTGAATAATGGAATAAAT 


180 


Db 


58197 


58138 


Qy 


181 


TCTTGATTATCAATAATCAATGCCCATTTTTGTGATGGCCAGGCGAGATCGGCTTCCGCA 

MIMIIIIIIIIMIIIIIIIIIIIIMIIIIIIIIIIIIIIIMIIIIIIIIIIIIII 

TCTTGATTATCAATAATCAATGCCCATTTTTGTGATGGCCAGGCGAGATCGGCTTCCGCA 


240 


Db 


58137 


58078 


Qy 


241 


ATAATTTCACCTTCATCATTTTGCAATTCGTAGCCGAGAGTCGGCGCAGAGAGAGAATGA 

IIIMIIIIIIMIIIIIIIIIIMMMMIIMIIIIMIIIIIIIIIIIIIMIIII 

ATAATTTCACCTTCATCATTTTGCAATTCGTAGCCGAGAGTCGGCGCAGAGAGAGAATGA 


300 


Db 


58077 


58018 



Qy 



301 GACTGAAGCAACGCGATCTCTTCAGCGCTTAATAAGCCAAACTCAATAATCTCAGCCCAA 36 0 



Db 58 017 GACTGAAGCAACGCGATCTCTTCAGCGCTTAATAAGCCAAACTCAATAATCTCAGCCCAA 57958 

Qy 3 61 CTCTCATCAGTCTCTGGTACTGCCTCTGGCGTTGCCGAAACTTTAGTGTCAACGCTTTCC 42 0 

IMIMIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIII 

Db 57 957 CTCTCATCAGTCTCTGGTACTGCCTCTGGCGTTGCCGAAACTTTAGTGTCAACGCTTTCC 57898 

Qy 421 TGCGGTAAATGCACGGCTTTACGGCTGGTAAAGGTCATATCCGGCAGGAACTGTAACAGG 4 80 

I ! I ' I i I I I ! I . : I I I I . ' I I I ' . I . , I . I I I ' [ I ' | | 

Db 578 97 TGCGGTAAATGCACGGCTTTACGGCTGGTAAAGGTCATATCCGGCAGGAACTGTAACAGG 578 3 8 

Qy 4 81 TTTACCAGTCGCCAGAAGCCATTGAGTCCCGCTTCATAACCGTCATCCTGCGTATAGCGA 54 0 

MIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIMIIIIIIIII 

Db 57 83 7 TTTACCAGTCGCCAGAAGCCATTGAGTCCCGCTTCATAACCGTCATCCTGCGTATAGCGA 5777 8 

Qy 541 TCGTCAAAACAGATATGCAAACGCAGCCAGCTGCGCATTTGTGCGACGCTGGTCGTCGGT 6 00 

IIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIMMIIIIMIIIIIIIIIIIIMIIII 

Db 5 7 777 TCGTCAAAACAGATATGCAAACGCAGCCAGCTGCGCATTTGTGCGACGCTGGTCGTCGGT 57718 

Qy 601 TTAATGGCTTGTTGTGGTAACACCGAAGCCAGCTCAATAAATTGCTGCGATGAGTTACAG 660 

II M 1 , 1 1 1 i 1 1 ' 1 1 1 1 1 1 ! 1 1 ;ll. 1 1 1 1 1 1 1 ! 1 1 . I 1 1 , 1 1 1 1 1 M I 1 1 1 

Db 5 7717 TTAATGGCTTGTTGTGGTAACACCGAAGCCAGCTCAATAAATTGCTGCGATGAGTTACAG 57 658 

Qy 661 CTATCGAGTAAACCACCAAAGACAAACGGTTCGTCTGGCAAAAGCTCTGCTAGCCGCCAA 72 0 

IIIMIIIIIIIMIIIIMIIIIIIIMIIIIIIIIIIIIIIIIIIMMMIIIIIII 

Db 57657 CTATCGAGTAAACCACCAAAGACAAACGGTTCGTCTGGCAAAAGCTCTGCTAGCCGCCAA 575 98 

Qy 721 GCTGGCGCGTTTTCCTGCATTTCATAGGCATACTTTTGTTTCGTTCCCGCGTCCTGTGAC 7 80 

IIIIMIIIIIIIIIIIIMIIIIMIIIIIIIIIMIIMIIIIIIIIIIIIIIIIIII 

Db 575 97 GCTGGCGCGTTTTCCTGCATTTCATAGGCATACTTTTGTTTCGTTCCCGCGTCCTGTGAC 5753 8 

Qy 781 TTCTTCACATCCAACCACACCCAGGCATGTGCAGCAGCCATTCTCTGCCATAACTGCGTT 84 0 

IIIIIIIIIIIIIIIIIIIIIIMIIIIIIMIIIIIIIIIIIIIMIIIIIIIIIIIII 

Db 5753 7 TTCTTCACATCCAACCACACCCAGGCATGTGCAGCAGCCATTCTCTGCCATAACTGCGTT 57478 

Qy 841 CTTCCGCCCGGATCGGCGAGATAATCCAGCAGCAGTGCAAAACTGTTGCGTTCCCGGAAC 90 0 

MIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIMIIMIIIIII 

Db 57477 CTTCCGCCCGGATCGGCGAGATAATCCAGCAGCAGTGCAAAACTGTTGCGTTCCCGGAAC 57418 

Qy 901 GACGCTTCAAGGGCCGCAAAGTTGGTATCATGAAACAGGTTATAAAACTTAGGCTGTTTC 960 

II : I I I I i I I I J I : J I I : I I 1 .' I E ! I I . : I I I I I I ' : I I I ! . I I : I 1 I I I ! I I I I I 
Db 57417 G ACGCTTCAAGGGCCGCAAAGTTGGTATC ATGAAACAGGTTATAAAACTTAGGCTGTTTC 573 58 

Qy 961 ATATCCGGGTTATGCCCAAGTCCCAGGACATCCTGAACATGTTTAAGCCCTGGCTCTTTG 102 0 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIMMIIIIIIII 

Db 573 57 ATATCCGGGTTATGCCCAAGTCCCAGGACATCCTGAACATGTTTAAGCCCTGGCTCTTTG 572 98 

Qy 1021 AGATCGG - CCACGTGACTGTCCACACCCAGAAATTACCGCTATCCCGAATCGCCTGGCGT 107 9 

IMI IIMIMIIIIIIIIIIIIIMIIIIIMIIIIIIIIIIIIIIIIIIIIIII 

Db 572 97 AGATCGGCCCACGTGACTGTCCACACCCAGAAATTACCGCTATCCCGAATCGCCTGGCGT 57238 

Qy 108 0 TTTTGCACATCCTCTGCGACGCTATTTTTGTGGAACGCAAAGCCATCAAGGAAAATAGCC 113 9 

IIIMMIIIIIIMIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 5 7237 TTTTGCACATCCTCTGCGACGCTATTTTTGTGGAACGCAAAGCCATCAAGGAAAATAGCC 57178 

Qy 114 0 ACCGGTTTAATGGCATCAGACTGCATAAGCGGATATAGCACGTAATCCGGGCGACAAGGG 119 9 

IIIIIIIIIIIIIIIIIIIIIIMIIIMMIIIIIIIMMIIIIIIIIIIIIIIIIII 



Db 57177 ACCGGTTTAATGGCATCAGACTGCATAAGCGGATATAGCACGTAATCCGGGCGACAAGGG 57118 

Qy 12 0 0 ATCCCCACCGCCTCTTTAGCGCCTAAATCCACCTGAGCTTTAAGATGCCAGCTCATCATC 12 5 9 

I M M M 1 1 M ! M M M II M I M M M M 1 1 M M i I II II II M 1 1 1 II I II I M 1 1 

Db 57117 ATCCCCACCGCCTCTTTAGCGCCTAAATCCACCTGAGCTTTAAGATGCCAGCTCATCATC 57058 

Qy 12 60 GGTTCGCTGCGGGTGTTAATAATCCACCCAGCGCCCTGATGCGCGTAACTGCGACTGACA 1319 

IMMIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIII 

Db 57 057 GGTTCGCTGCGGGTGTTAATAATCCACCCAGCGCCCTGATGCGCGTAACTGCGACTGACA 56998 

Qy 13 2 0 ATCAGGTTTTTATTTTCTTGC AGGCAGCTAATAAACCGTTTTTCCAGCTCACTGCCCATC 13 79 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIMIIIIIIIIII 
Db 5 6997 ATCAGGTTTTTATTTTCTTGCAGGCAGCTAATAAACCGTTTTTCCAGCTCACTGCCCATC 56938 

Qy 13 80 ATGGCTTCGAGGGAAATGTTTTTGATGGAATCAATTACCCGTATCGAATCTCTTGCCTGC 14 3 9 

M I II II M I II M II M M M 1 1 1 1 M M M 1 1 1 M 1 1 M M I M I M II M 1 1 1 1 1 II 

Db 5 6937 ATGGCTTCGAGGGAAATGTTTTTGATGGAATCAATTACCCGTATCGAATCTCTTGCCTGC 56878 

Qy 144 0 AAAATTTTCGCCAGCAAGAGACGCGCCTGATCGCGGGAGACATACTTCATGCGCCCACGG 14 99 

IIIIIIIIMIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 5 6877 AAAATTTTCGCCAGCAAGAGACGCGCCTGATCGCGGGAGACATACTTCATGCGCCCACGG 56818 

Qy 15 0 0 TCACGGTAAGCGTAAACGCAACGATAACAACCATCTTTATGCGTATCATTGTTGCAACTG 155 9 

I i I I I I I I ' ; I I I : I I I i 1 , I i I I I : ; I ; I ! I I I I ' I ' I I I 

Db 5 6817 TCACGGTAAGCGTAAACGCAACGATAACAACCATCTTTATGCGTATCATTGTTGCAACTG 56758 

Qy 1560 CATTCCACTATTGCCTGATAGGCC 15 8 3 

MIIIIIIIIIIIIIMIIIIIII 

Db 56757 CATTCCACTATTGCCTGATAGGCC 56734 




RESULT 1 

US- 09-453 -702B- 242 /c 

; Sequence 242, Application US/09453702B 
; Patent No. 6365723 

GENERAL INFORMATION: 

APPLICANT: Blattner, Frederick R. 
; Bur land, Valerie 

; Perna, Nicole T. 

; Plunkett, Guy 

Welch, Rod 

TITLE OF INVENTION: No. 6365723el Sequences of E. coli 0157 
NUMBER OF SEQUENCES: 2 65 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Quarles & Brady 

STREET: 1 South Pinckney Street 

CITY: Madison 

STATE: WI 

COUNTRY: US 

ZIP: 53701-2113 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette, 3.50 inch. 1.44Mb storage 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS -DOS 

SOFTWARE : Word Perfect 8 . 0 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 09/4 53 , 7 02B 

FILING DATE: 03 -Dec- 1999 

CLASSIFICATION: <Unknown> 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 60/110,955 

FILING DATE: 04-DEC-1998 
ATTORNEY/ AGENT INFORMATION: 

NAME: Seay, Nicholas J. 

REGISTRATION NUMBER: 273 86 

REFERENCE/ DOCKET NUMBER: 9602 96.95 017 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (608) 251-5000 

TELEFAX: (608) 251-9166 
INFORMATION FOR SEQ ID NO: 242: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 31880 
; TYPE: nucleic acid 

STRANDEDNESS : double 

TOPOLOGY: linear 
MOLECULE TYPE: DNA (genomic) 
SEQUENCE DESCRIPTION: SEQ ID NO: 242: 
US-09-453-702B-242 

Query Match 100.0%; Score 96; DB 4; Length 31880; 

Best Local Similarity 100.0%; Pred . No. 2.7e-26; 

Matches 96; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 TATCCCGAATCGCCTGGCGTTTTTGCACATCCTCTGCGACGCTATTTTTGTGGAACGCAA 6 0 

IIIIIIIIIIIIIIIIIIMIIIIIIIIIMIIIIIMIIIIIIIIIIIMIIIIIIIII 

Db 2 7304 TATCCCGAATCGCCTGGCGTTTTTGCACATCCTCTGCGACGCTATTTTTGTGGAACGCAA 2 7 245 



Qy 



61 AGCCATCAAGGAAAATAGCCACCGGTTTAATGGCAT 96 



IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIII 

Db 2 7244 AGCCATCAAGGAAAATAGCCACCGGTTTAATGGCAT 27209 



^onp- Sep 5~ 



RESULT 1 

US-09-453-702B-242/C 

; Sequence 242, Application US/09453702B 
; Patent No. 6365723 

GENERAL INFORMATION: 

APPLICANT: Blattner, Frederick R. 

Burland, Valerie 
; Perna, Nicole T. 

; Plunkett, Guy- 

Welch, Rod 

TITLE OF INVENTION: No. 6365723el Sequences of E . coli 0157 
NUMBER OF SEQUENCES: 265 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Quarles & Brady 

STREET: 1 South Pinckney Street 

CITY: Madison 

STATE: WI 

COUNTRY: US 

ZIP: 53701-2113 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette, 3.50 inch. 1.44Mb storage 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS -DOS 

SOFTWARE: Word Perfect 8.0 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/453 , 702B 

FILING DATE: 03-Dec-1999 

CLASSIFICATION: <Unknown> 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 60/110,955 

FILING DATE: 04-DEC-1998 
ATTORNEY /AGENT INFORMATION: 

NAME: Seay, Nicholas J. 

REGISTRATION NUMBER: 2 7386 

REFERENCE/DOCKET NUMBER: 960296.95017 
TELECOMMUNICATION INFORMATION: 

TELEPHONE : (608) 251-5000 

TELEFAX: (608) 251-9166 
INFORMATION FOR SEQ ID NO: 242: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 318 8 0 

TYPE: nucleic acid 

STRANDEDNESS : double 

TOPOLOGY: linear 
MOLECULE TYPE: DNA (genomic) 
SEQUENCE DESCRIPTION: SEQ ID NO: 242: 
US-09-453-702B-242 

Query Match 100.0%; Score 81; DB 4; Length 31880; 

Best Local Similarity 100.0%; Pred. No. l.le-21; 

Matches 81; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 CGGTTTAATGGCTTGTTGTGGTAACACCGAAGCCAGCTCAATAAATTGCTGCGATGAGTT 6 0 

IIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
Db 2 7 768 CGGTTTAATGGCTTGTTGTGGTAACACCGAAGCCAGCTCAATAAATTGCTGCGATGAGTT 2 7709 



Qy 



61 ACAGCTATCGAGTAAACCACC 81 



2 7708 ACAGCTATCGAGTAAACCACC 27688 



